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Generalized Dirac bracket and the role of the Poincare symmetry 
in the program of canonical quantization of fields 1 

Marcin KazmierczalQ 

Institute of Theoretical Physics, University of Warsaw ul. Hoza 69, 00-681 Warszawa, Poland 

An elementary presentation of the methods for the canonical quantization of constraint systems 
with Fermi variables is given. The emphasis is on the subtleties of the construction of an appro- 
priate classical bracket that could be consistently replaced by commutators or anti-commutators 
of operators, as required by canonical quantization procedure for bosonic and fermionic degrees of 
freedom respectively. I present a consequent canonical quantization of the Dirac field, in which the 
role of Poincare invariance is made marginal. This simple example provides an introduction to the 
Poincare— free quantization of spinor electrodynamics in the second part of the paper. 
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I. INTRODUCTION 



The canonical quantization scheme can be briefly described as follows: to find a quantum version of a given classical 
theory, one formulates this classical theory in the Hamiltonian framework in terms of Poisson brackets. Then the 
\ classical dynamical variables are promoted to the operators and the Poisson brackets to the commutators (up to the 
Q-r factor ih). The space of states is obtained by looking for representations of the commutation relations of physically 
important observables in a Hilbert space. 

The usefulness of these method of quantization was evident from the very beginning of quantum theory. The historic 
monograph by Dirac [l[ contains a beautiful presentation of the underlying motivations. With the realization that 
the theories of particles, both massless and massive, need to be viewed as quantum field theories, the attempts of 
- 1 ■ canonical quantization of field-theoretical systems began. Although the extension of the formalism to the uncountable 
\ number of degrees of freedom did not cause much trouble (except for the usual difficulties in mathematically rigorous 

■ formulation), there were two other features of physically relevant field theories that were problematic. 

[*""». \ One of them was related to the occurrence of degenerate Lagrangians that lead to the presence of constraints in the 
ON • Hamiltonian formalism. These problems were partially resolved already by Dirac It appeared that the constraints 
may be of two kinds. One of them leads to the presence of gauge freedom in a physical system and the other to 
the necessity of replacing the conventional Poisson bracket by the new classical bracket (the Dirac bracket) in the 

■ quantization procedure. 

The second problem was bound up with the relation between spin and statistics. As explained in classical refferences 
[H, the field operators for particles with half-integer values of spin that are evaluated at spatially separated space-time 
points should anti-commute, rather then commute. This poses a problem for the canonical quantization scheme, since 
^ . both the Poisson bracket and the Dirac bracket are antisymmetric in their arguments and hence cannot be consistently 
k^J replaced by symmetric anti-commutators of operators. The solution to this problem was proposed by Bellinfante at 
all. [J . The methods discussed there acquired a rigorous mathematical formulation in the papers by Casalbouri [5] [6] . 
It appeared that it was necessary to introduce two kinds of classical variables, whose multiplication is not necessarily 
commutative. Specifically, the multiplication of the so called even-type variables with all the others is commutative 
and the multiplication of odd-type ones between themselves is anti-commutative. After these Grassman variables are 
introduced, it is possible to define the generalized Poisson bracket that is symmetric whenever both variables are odd 
and anti-symmetric otherwise. Also, the bracket possesses other important algebraic properties. The introduction of 
Grassman variables in the presence of the constraints is discussed in |6| . 

More recently, the discussion of constrained systems with Fermi variables can be found in Q and [10], but the 
authors of these references decided to focus on general considerations, rather than the applications of the formalism 
they developed. Handbooks of quantum field theory either restrain from discussing the canonical quantization of 
constrained systems at all [8| in favor of path integral approach, or present the discussion of the Dirac bracket that 
is relevant for bosonic systems only [3j . The anti-commutation relations for the Dirac field are then derived from the 
abstract group theoretical arguments that invoke to the Poincare symmetry of space-time and the assumed Lorentz 
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transformation properties of the Dirac field, as well as the discrete symmetries and the causality arguments. The 
canonical anti-commutation relations are then not necessary. Although this approach is indisputably elegant, it has 
a failure of not being expendable to the case of curved space-time, which does not posses the Poincare symmetry. 
On the other hand, the advantage of the canonical quantization is that the Poincare symmetry does not have to 
be employed at all. Not only the space-time metric does not need to be flat - it is not necessary at all in the 
quantization procedure. Indeed, much of the motivations underlying the development of canonical methods was 
related to the attempts to quantize gravity in a background independent way. See e.g. |9] for the presentation of one 
of the advanced manifestations of these attempts. One of the points raised in this reference was the prevalence of 
the Poincare symmetry and the frequent usage of the Minkowski metric in standard quantum field theory (see p. 3 of 
Q), e.g. when formulated in terms of Wightman axiomatics. The author finds these properties of QFT an important 
obstacle to the straightforward extension to the gravitational case. One of the aims of my article is to show that the 
Poincare symmetry and the Minkowski metric need not be used almost at all, if the canonical quantization scheme 
is consequently followed (the meaning of almost will become clear during the presentation). In this first paper of the 
series I will review the general formalism for canonical analysis of constraint systems with Fermi degrees of freedom. 
Than I will focus on the theory of the free Dirac field, which is the simplest physically relevant example containing 
both the difficulties that were mentioned. In the forthcoming article the electromagnetic interaction will be turned 
on and then the issue of gauge invariance and gauge fixing will be addressed in more details. 



II. THE CANONICAL ANALYSIS OF CONSTRAINED SYSTEMS WITH FINITE NUMBER OF 

DEGREES OF FREEDOM 



A. Classical mechanics of constrained systems 



Our starting point is a classical theory in a Lagrangian formulation, whose equations of motion are defined by the 
stationarity condition for the action functional 

%(*)] = f L(q,q)dt , (III) 

where q = (q 1 ,...q N ) represents the positions. In order to pass to Hamiltonian formulation, one defines canonical 
momenta as functions of positions and velocities 

BT 

Pn~Q^(q,q), n = l...N {11.2) 

and the canonical Hamiltonian 



H := Pn q n - L (II.3) 

(here and further in the article the Einstein summation convention applies whenever the indexes repeat, unless 
otherwise stated). Although H is originally a function of positions and velocities on account of (111.21) . it can be 
expressed as a function of positions and momenta instead, since its variation 

/ Pit \ Pit Pit 

5H = [ Pn - w) sr + rdPn " w 5qn = r6pn " w Sqn (IL4) 

depends on 5q only through the combinations 5p n = g „/q^„ Sq n + d ° n iQ^ n ^Q n ■ An expression for H as a function of 
the q's and the p's, although always exists, needs not be unique. To find one in practice, one could wish to calculate the 
velocities as functions of positions and momenta from (|II.2[) and insert the result into (|II.3p . However, the condition 
for this to be possible for all the q's is that the determinant of the matrix d 2 L/dq n dq n does not vanish. If this 
condition fails to hold, which is the case for many systems of direct physical importance, then one cannot compute 
form pi.2[) all of the q's as functions of the q's and the p's. This is however not necessary, since in this case (|II.2|> 
implies some relations between the q's and the p's of the form 

<t> m (q,p) = 0, m = l,...,M (II.5) 

which make the dependence of H on the remaining velocities vanish. These conditions, called the primary constraints, 
define a submanifold in phase space, called the primary constraint surface. The canonical Hamiltonian is well defined 
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only as a function on this surface. It is therefore allowed to use (|IL5[) when expressing H as & function of the q's and 
the p's, thus obtaining many equivalent expressions for H. 

The Hamilton equations of motion that are equivalent to the Euler-Lagrange equations of (jll.lj) are given by 

.„ dH . m d(j> m 
1 



dp n dp„ ' 

Pn = _9H_ u m^m (H.6) 
™ dq n dq n ' 

4>m = 0, 

where the it's are functions on phase space whose dependence on the q's and the p's should be determined in such 
a way that the system of equations pi.6[) have a solution (if this is not possible than it means that the system of 
Euler-Lagrange equations of was contradictory) . See [7| for the proof of the equivalence of ((II. 6[) and the 

Euler-Lagrange equations. For our purposes it is important to note that the Hamilton equations can be rewritten as 



where 



q n = [q n ,H T } P , p n = [p n ,H T } P , <$> m = 0, (II.7) 



H T :=H + u m (j> m (II. 



is called the total Hamiltonian and [,]p stands for the usual Poisson bracket (PB) which is defined for any dynamical 
variables F(q,p) and G(q,p) by 

dF dG dG OF 
[FMp -Wdp- n ~WWn (IL9) 

Note that it is not necessary to calculate the PB between u's and the q's and the p's in (|II.7j) . since all the brackets 
containing u's will be multiplied by the constraints, and thus this components will vanish on account of the last 
equation of pi.7[) . The evolution equations for any dynamical variable F(q,p) can now be expressed in an extremely 
simple form 

F^[F,H t ]p, (11.10) 

where the weak equality symbol w means that the equality holds if the constraints are imposed on the final form of 
the expressions (note that the constraints cannot be imposed on Ht before its Poisson bracket with F is calculated!). 

In order to quantize the theory, one needs to be aware of all the constraints, i. e. it is necessary to find all 
the independent relations of the form C(p, q) = that are satisfied at any instant of time. In general, the primary 
constraints <j> m will not provide a complete set of such relations. Additional constraints can follow from the requirement 
that </> m 's are preserved in time: 

0m ~ [<pm, Hp]p ~ [4>m,H]p + U m [(j) m , (j) m ']p ~ 0. (11.11) 

For e given m, this equation can provide a restriction on u's (which happens whenever there exists a constraint that 
does not commute with <j) m ), or it can yield another constraint, which may be independent of (j) m 's or not. If an 
independent constraint is obtained, one should require it to be preserved in time as well. This can yield yet another 
independent constraints or farther restrictions on u's. The procedure ought to be continued until all the constraints 
are found. The additional constraints obtained in this way are called secondary. Following Q, I shall denote the set 
of all the constraints by 

4>j, j = l,..-,J, (IL12) 
where J — M is the number of secondary constraints. At the end, these constraints need to satisfy 



H] P + u m ■ [<t>j,<f> m ']p ~0 (H.13) 



(if they don't, more constraints or restrictions on u's are needed). At any fixed point (p, q) of the phase space, (jll. 13|) 
can be viewed as a system of linear equations for u's. The general solution is provided by 

u rn = U ra + yrn ^ yrn = ^ym ^ (jj U ) 
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where U is a particular solution of the inhomogeneous system, V is the general solution of the homogeneous part, V a 
constitute an arbitrary basis for the space of solutions of the homogeneous equation and v a are completely arbitrary 
functions of time. 

A dynamical variable F{q,p) is called first class if it commutes 1 with all the constraints. It is easy to verify that a 
modified Hamiltonian 

H' = H + U m <p m , (11.15) 

called the first class Hamiltonian, is first class. Note that U is a particular solution of (|II.13j) which can be chosen in 
a definite form, hence there are no arbitrary functions in H' , although one can construct H' in meny different ways 
by choosing different particular solutions U. The total Hamiltonian (|II.8|) that determines the dynamics of the system 
can now be rewritten in the form from which some of the arbitrary functions have been eliminated 

H T = H' + v a <j> a , cj> a := V™<t> m . (11.16) 

The arbitrary functions v a remain present in the final form of the dynamical equations and indicate the presence of 
gauge freedom in the system, as explained below. 

All the constraints can be separated into first class and second class constraints 2 , which I shall denote by j a and Xa 
respectively. There are many possible realizations of the system of constraints in a given theory, all of them defining 
the same submanifold in phase space. For example, one can add the constraints, multiply them by functions or insert 
them as arguments of functions that vanish at zero 3 . Performing that kind of operations can change the total, as well 
as relative number of first and second class constraints. Indeed, adding a second class constraint to the first class one 
will result in a second class constraint. In fact, all the constraints could be made second class in this way, but this is 
not what we wish to achieve. I will say that the constraints are well separated into first and second class ones if the 
number of second class constraints is made minimal. 



B. Gauge freedom 

In classical physics, the time evolution of a system is expected to be deterministic. It appears that imposing 
such an assumption on a system with first class primary constraints leads to the conclusion that there is no one to 
one correspondence between points on the constraint surface and physical states. To see this, consider an arbitrary 
dynamical variable F(t) = F(q(t),p(t)) whose dependence on t is analytic and whose value at an instant of time to is 
well established. The value of F at time t = to + r will be 

T 2 

F(t + r) = F{to) + rF{to) + yf(t ) + . . . (11.17) 

Using F = [F, Ht], F — [[F, H T ],H T ] and pLT6| one obtains 

F(* + t)=F + t ({F, H'\ + v a [F, 4> a ]) 

t 2 r i ( IL1 8) 

+ — [[[F,H'],H']+v a {2[[FMH'] + [F, [H'M) + v a v a [[F, </> a ], <f> a -]} + o(r 3 ), 

where on the RHS all the time dependent quantities (the functions v a and the brackets) should be evaluated at to- 
Now the functions v a can be prescribed arbitrarily. By adopting an alternative set of functions, say v a , one gets 
different value F(to+r) of the dynamical variable F at to + r. Clearly, this difference cannot be physically meaningful 
if time evolution is to be deterministic. Rather, the difference should be interpreted as gauge freedom on which no 
measurable physical quantity should depend. Up to linear terms in r, this difference is simply 

SF(t + r) = F{t a + t) - F{t + r) = T Sv a (t )[F, o ](i<,), Sv a := v a - v a . (11.19) 



1 Commutation means vanishing of the Poisson bracket in the case of bosonic variables and vanishing of the generalized Poisson bracket 
(discussed later) for fermionic ones. 

2 A phase space function is called second class if it fails to satisfy the first class condition. 

3 If the constraints are required to satisfy some regularity conditions (see 1.1.2 of pj), than the set of possible operations that can be 
performed on the constraints is reduced to addition and multiplication by functions, since then the theorem holds that states that any 
function that vanishes on the constraint surface is necessarily a linear combination of the constraints (theorem 1.1 of 
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The infinitesimal form of the transformation, pi.l9[) . justifies the statement that the gauge transformation corre- 
sponding to the change in a particular function v a (for fixed a) is generated by the corresponding constraint (j> a . 
The primary constraints <j) a . as defined in (jTLlo) . are first class, since for any constrain <pj one has [<j)j,<p a ] — 
[0j,V^ m ]0 m + V™[<f>j,4> m ] ~ 0, where the vanishing of the last term follows straightforwardly form the definition 
of V a 's as solutions to the homogeneous part of the system (|II.13[) . The conclusion follows that first class primary 
constraints generate gauge transformations. Consequently, if a dynamical variable F is to represent an observable, it 
should commute with </> Q 's. This guarantees that F does not change under gauge transformations up to terms linear 
in r (see (IIL19[) ). However, in order to guarantee the invariance of F up to second order terms in r, it is necessary 
to assume additionally that F commutes with [<fi a ,H'] (look at the terms proportional to r 2 in (JTT7T8J) ) . Although 
[4> a ,H'] is certainly a first class constraint 4 , it does not have to be spanned by primary constraints. It follows that 
F may in general need to commute with some secondary first class constraints, in order to be gauge invariant. If 
a dynamical variable commutes with all the first class constraints, both primary and secondary, than it is called 
a classical observable. Such observables are gauge invariant up to any order in the expansion (|II.18|) and thus can 
describe physically measurable quantities. 

Evolving the basic dynamical variables q and p from an instant to to some instant t with different choices of the 
arbitrary functions v a will yield a collection of points of phase space (q(t) , p(t)) , all of them describing the same physical 
state of the system at the instant t. The points in this collection are related by gauge transformations generated by 
the first class primary constraints. However, there may exist other points that also describe the same state, if there 
are secondary first class constraints present. In the cannonical approach to the quantization of constrained systems it 
is common to assume that all the first class constraints, both primary and secondary, generate gauge transformations, 
although this assumption may fail to be true for some special systems (see the counterexample to the Dirac conjecture 
in Q). To make all the gauge freedom manifest in the equations of evolution, the so called extended Hamiltonian is 
adopted as a generator of the dynamics 

H E = H' + w b lb , (11.20) 

where b numbers all the first class constraints and w b are arbitrary functions (note that He differs from Ht by the 
presence of secondary first class constraints). It is the dynamics generated by Ht, and not He, that is equivalent to 
the Euler-Lagrange equations of II) . However, it is not difficult to see that the dynamics of any classical observable 
does not depend on the choice between Ht and He- The general comparison of the evolution equations generated by 
Ht and He can be found in Q- I will just illustrate the difference on the example of electrodynamics in the second 
part of this article. In fact, the reader does not have to bother by the discrepancy between Ht and H E and the 
philosophy underlying the preferential use of H E over Ht in the canonical analysis. In the gauge fixing approach to 
the quantization, which will be ultimately adopted in the second part of this paper, it does not matter which dynamics 
is utilized. 

From the viewpoint of quantum theory, what we wish to extract from classical Hamiltonian analysis are the 
commutators between the physically important dynamical variables. The appropriate classical bracket acting on pairs 
of phase space functions should be identified that will then be replaced by (anti)commutators. In the absence of 
constraints and fermionic degrees of freedom, the Poisson bracket does the job. But it is not consistent neither with 
the constraints, nor with the presence of fermions (an antisymmetric structure cannot be consistently replaced by the 
symmetric anti-commutator). I shall now define the Dirac bracket, which is the modification of PB needed to handle 
the constraints in the absence of fermions. Later, the concept of the generalized Dirac bracket will be introduced that 
allows for inclusion of fermionic degrees of freedom. 

C. The Dirac bracket 

Assume that the set of all the constraints is well separated into the first class constraints 7^ and the second class 
constraints xp- The Dirac bracket (DB) of the dynamical variables F and G is defined by 

[F,G]d = [F,G] P - [F, Xp } P C^'[ X 0',G}p, (11.21) 

where represents the inverse matrix to Cppi := [x/3,X/3'}p, i- e - Cpip» — 5^,, (the matrix Cppi is necessarily 
invertible, since otherwise one can construct a first class constraint from xp' s , which means that the constraints were 



4 It is first class because the bracket of first class functions is of first class. This follows straightforwardly from the Jacobi identity. The 
bracket [4> a ,H'\ is also a constraint, since H' is first class and hence its bracket with any constraint vanishes weakly. 
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not well separated. See Theorem 1.3 of |7|). It is easy to check that the Dirac bracket is anti-symmetric, obeys 
the Jacobi identity and Leibniz rule. Hence, the Dirac brackets can be replaced by commutators when passing to 
quantum theory in a consistent way. What is an advantage of DB over PB is that DB of any dynamical variable with 
a second class constraint vanishes. This allows for interpreting second class constraints as strong operator equations 
if a theory is quantized by the replacement of the Dirac brackets by the commutators of operators (the commutator 
of any operator with zero needs to vanish). Also, the DB of any variable with a first class variable is equal to the PB 
and therefore the equation of motion (III.10[) can be rewritten as 

F w [F, H' + v a (f> a ]DB- (11.22) 



D. The anti-commuting Grassman variables and the generalized Dirac bracket 

In the quantum theory of fields, the field operators that describe particles of integer value of spin can be characterized 
by appropriate commutation relations, whereas those of half integer spins obey anti-commutation relations. This 
distinction follows basically from the postulate of the Poincare invariance, the assumption that the fields ought to be 
expressible as weighted integrals of annihilation and creation operators, and the requirement of causality. See Q for 
more detailed justification of the connection between spin and statistics. 

As mentioned in the previous subsection, it is not possible to replace consistently the Dirac brackets of basic 
canonical variables by anti-commutators, since the two structures have incompatible symmetry. What is needed is 
the classical bracket that is antisymmetric if at least one of the variables is bosonic and symmetric if both the variables 
are fermionic. In order to obtain this structure it is necessary to introduce the two kinds of variables already at the 
classical level. I shall assume that the classical variables can be even (bosonic) or odd (fermionic), or they can be a 
sum of those. When these Grassman variables are introduced, their multiplication is no longer commutative. An even 
variable commutes with all the others, whereas odd variables anti-commute with one another. Let q l , 9 a constitute 
the set of positions of even and odd type, respectively. The multiplication rules are 

q i q j - q j q i = 0, 6 ot q i - q l 9 a = 0, 9 a 9 p + 9 p 9 a = 0. (11.23) 

The only functions of the positions which I shall consider will be analytic in the odd variables 

f(q, 0) = fa(q) + fa(q)0 a + f a p(q)9 a 9? + ... (11.24) 

Any such function is a sum of an even part Je and an odd part fo- 

f(q,0)=f E (q,6) + fo(q,0), 

f E (q, 0) = f (q) + f a p(q)0 a 0? +... (11.25) 
fo(q, 0) = f a (q)6 a + f aPl (q)9 a 9^9^ + ... 



1. Differentiation with respect to odd variables 



Under the infinitesimal variation of odd variables, the function (|II.24|) changes as 

d L f d R f 

Sf = S9 a —^ = ^S9 a . (11.26) 
d9 a d9 a K ' 

The above equation should be considered as definition of right derivatives and left derivatives with respect to odd 
variables. From (|II.24|) it is clear that the derivative of an odd function is even and vice versa, and hence the relations 
follow: 

d^fE^^d^fE d^fo^d^fo , , 

d9 a d9 a ' d9 a d9 a ' 1 ' ' 

Let us consider an example of a function represented by a series that cuts off at the third order term in odd variables: 

f(q,9) = f (q) + f a (q)9 a + f a p(q)9 a 9^. (11.28) 
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The derivatives are then given by 



8 L f B R f 

30T = A - 2 f^e a + 3f af3x 9 a 9P, -^=h + 2f aX 9 a + 3f afiX 6 a 9^. (11.29) 
Variating this expressions with respect to 9 yields 

= S0K (~ 2fnX + 6f *^ = H /kA - 6 f*^ 0P ) ^ > 

from which the second derivatives with respect to odd variables can be read out. It follows that 



(11.30) 



d^_ (d R f\ = (d^£\ d^_ (d L f\ = _d^_ (d L f\ d^_ (d R f\ = _ d*_ (d R f\ 

86* V 99 x J 86 x V 06* ) ' 86* \ 86 x J 86 x \ 86* ) ' 86* V dd x ) 86 x \ 86* J ' 1 ' ' 



The last two equalities imply that 



8 L [8 L f\ 8 R (8 R f 



86* V 86^- J 86* \ 86- 



0. (11.32) 



Here the bar below k means that Einstein summation convention is not applied, i.e. k is a fixed value. The identities 
(|II.31[) and (|II.32[) can be proved by induction to hold for any function of the form f|II. 24[) and are necessary to prove 
Jacobi identity, to be discussed below. It is also easy to see that the derivatives with respect to even variables commute 
with those with respect to odd ones. 

2. Harniltonian formalism in the presence of odd variables 

The action of the theory containing both even and odd positions is of the form 

S = J L(q,q,9,9)dt, (11.33) 

where the Lagrangian is assumed to be even (the time derivatives of even/odd variables are obviously of the same 
type as the original variables). The equations of motion that follow from the stationary condition for this action are 



8L d ( 8L\ 8 L / R L d fd L ' R L 



8q l dt \dtf J ' 86 a dt \ 86 



(11.34) 



where L/ R means that either type of the derivative can be used, if it is the same for both sides of the equation. The 
canonical momenta are defined by 

8L T 8 L L p 8 R L 
Pl := — , tt£ := — ^, tt£ := — ^. 11.35 

8q 1 ' 86 a 86 a 

Since the Lagrangian is even, it follows that ir^ = —tv r . Finally, the canonical Harniltonian is given by 

H = fa + 9 a rt -L = v4 + tt r 6 a - L. (11.36) 

Note that one can use either left or right momenta, but the order of factors in the terms 6 a ir^ — —^6 a and 
tt r 6 = —6 a ir R has to be chosen correctly in order to avoid minus signs. 

It is also allowable to use left momenta for some odd variables and right momenta for the others. Such a mixed choice 
will appear to be particularly convenient in the case of the Dirac field, where I shall choose to right-differentiate with 
respect to the components of the field ip and left-differentiate with respect to the components of its Dirac conjugate 
ip. Before turning to the Dirac field in the next section, I shall now consider a theory with finite number of degrees 

of freedom, defined by the Lagrangian L (q,q,9,9,9,9j . I do not assume anything about the relation between the 
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positions and 0, but I choose to right-differentiate with respect to O's and left-differentiate with respect to 0's. The 
momenta and the Hamiltonian are given by 

_ BL R _ 8*L _ __ L _ B L L 

9q' B6 a Q6 (11.37) 

H= Pt q t + nJ a + t*W a -L. 

Variation of the Hamiltonian is given by 

— BT B R T — B L T 

5H = 5 Pl q l + StvJ" + 6 a 5if a - —5q l - —56 a - 86 , (11.38) 

Bq L B6 a B6 

where the definitions of the momenta (|II.37[) where used in the calculation. Also, the variation of H interpreted as a 
function of positions and momenta is 

,n - + + + + ( „. 39) 

Bq l Bp, 39 a B6 oir a dw a 



3. Unconstrained systems: the generalized Poisson bracket 

If the equations (|II.37[) can be used to express velocities in terms of positions and momenta in a unique way and 
do not lead to any relations between positions and momenta, then the positions and momenta are independent and 
hence their general variations are linearly independent. It then follows from (|II.38|) and (|II.39[) that 

l= BH , a ^dH_ BL BH BL_ _BH_ BL_ _BH_ 

1 Bp, 1 dir a ' Bn a ' Bq* dq*' B6 a B6<*' bT BT' ( 1 

Here and below the derivatives with respect to 0's and 7f's are defined as right derivatives, whereas those with respect 
to 6's and it's are left derivatives. In order to rewrite these equations in a form that does not include the velocities, 
the relations following from the Euler-Lagrange equations (III.34[) 

BL . BL . BL ^ 

W =Pi > Be^ = na ' W = * a (IL41) 

need to be used. When inserted into pi. 401) . they lead to 

■i dH aa dH > BH . BH . BH j_ BH 

q= Bp~' 6 9 = Pi = ~W na = -d6^' * a = -W (IL42) 

These Hamilton equations are equivalent to the Euler-Lagrange equations (]II.34[) . The time derivative of any dynam- 
ical variable F(q,p, 9, 6, it, If) is given by 

• BF . BF . BF ■ > BF . BF BF ■ 

F= W q+ op~ Pl + + 6 W + ^ + 

_ BF BH BF BH BF BH BH BF BH BF BF BH [ ' ' 

~ WWi ~ dplW + 86^ + dW^BT ~ d0^d^~ ~ dW^W 

In quantum theory, the dynamical variables will be replaced by operators. In order to accomplish the canonical 
quantization, we need to find the classical bracket [, ]gp, which I shall call the generalized Poisson bracket (GPB), for 
which the relation 

[F,G] T =z[^G] GP (11.44) 

will hold (the units in which c = h = 1 will be used throughout), where F and G are the classical dynamical variables 
and F, G the corresponding operators. The operators that arise from even classical variables via the map will 
be called bosonic, whereas those arising from odd variables fermionic. The bracket [F, G]^ should be interpreted as 
commutator [,]_ if at least one of the operators is bosonic. If both the operators are fermionic then [, ] T denotes 
anti-commutator [,] + . If the opearators do not have definite parity then they can be expressed as sums of those with 
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well established parity and then the linearity of should be used 5 . I wish that the time derivative (|II.43|) of any 
dynamical variable expresses through the classical bracket as F — [F, H] G p, which implies on account of pi.43[) that 

_ dF dE dF dE dF dE dE dF dE dF dF dE 

where E is any even dynamical variable (not necessarily the Hamiltonian) and the variable F is completely arbitrary. 
Now (|II.44[) implies that [,}gp needs to have the same algebraic properties as This means in particular that it 
is anti-symmetric whenever at least one of the variables is even and hence 

[E,F] GP = -[FE] GP . (11.46) 

The formulas pi.45[) and pi.46[) specify GPB in the case when one of the variables is even. It only remained to find 
GPB in the case when both the variables are odd. If A, B and C are odd variables, then pi.44[) implies that GPB 
should satisfy two algebraic conditions 

[A,B] G p = [B,A} GP , [A,BC] G p = [A,B] GP C-B[A,C] G p. (11.47) 

The latter one follows from the operator identity [A, BC}- = [A, B] + C — B[A, C]+ and from the fact that the product 
of two odd operators is even. I make an assumption that the classical bracket is composed from the products of partial 
derivatives with respect to basic canonical variables. More precisely, I assume that 

rin , dAdB dAdB dA dB dB dA dB dA dA dB , . 

[A > b]gp = c WW> + + a d^d^ a + f Wd^ + b d¥~ a W + g d¥~ a W (IL48) 

for some complex numbers c, d, a, f, b, g, to be determined from pi.47[) . In the calculations that will follow it 
is important to remember that differentiation with respect to even variables does not change the type of parity, 
whereas differentiation with respect to odd variables reverses the parity, so e.g. dA/dq 1 is odd and dA/d8 a is even. 
Remembering this, it is straightforward to verify that 



[A,B] GP -[B,A] G p= (c + d) 



dA dB dA dB 
dq l dpi dpi dq l 



, .dAdB dB dA\ n J dA dB dB dA^ {UA ' >] 



dO a dit a d9 a dir a J v \dW a dT dlf a d8 
If this expression is to vanish for any odd variables then it has to be d = — c, / = a, g = b. It follows that 

„. {dAdB dAdB\ {dAdB dB dA \ f dB dA dB \ . . 

[A > b]gp = c [d^Wi ~W l W) +a \d9^d^ + d9^d^ a ) + b + d¥~ a W ) ■ (ILo0) 

The remaining freedom of the parameters c, a, b is eliminated when the second algebraic condition in (|II.47|) is 
imposed. To calculate [A,BC] G p one should use pi. 451) for F = A and E = BC (note that BC is even as a 
product of odd variables and hance pl.45j) applies). The derivatives of BC with respect to even variables can be 
decomposed according to the standard Leibniz formula, but some care is necessary when differentiating with respect 
to odd variables. Specifically, the following relations hold 



d{BC) 
d9 a 


™CA 
d9«^ 


dC 
d6<*' 


d(BC) 
dT 


dB 

~w c ~ 


dC 


d{BC) 


--C- 




d(BC) 


= —C- 




dTT a 


dW a 


dW a 


dn a 


dir a 


d-K a 



(11.51) 



To prove these relations it is sufficient to calculate the variation of BC and remember that all the derivatives with 

—Oi 

respect to 9 a and ir a are understood to be right derivatives and those with respect to 9 and 7r Q are left. For example, 
a simple calculation of variation with respect to 8 a 

d R B d R C / dB dC \ 

5{BC) = SBC + BSC = ^-56 a C + B——89 a = -^-C + B— S9 a (11.52) 
v ' d9 a d9 a V d9 a d8 a I y ' 



5 The map " is assumed to be linear and all the brackets are bilinear with respect to C numbers. 
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proves the first of the identities. Having these results at hand it is straightforward to establish that the second identity 
of pi.47[) will be fulfilled if and only if a = 1, 6 = — 1 and c = 1. The final form of the GPB for two odd variables can 
then be given: 

_ dAdB dAdB dA dB dB dA dB dA OA dB 

which, together with (|II.45[) and (|IL46[) and the assumption of bilinearity of GPB, defines GPB for all the variables. 
This GPB is the same as the bracket given by the formula (4.1) of {10|, although it is not so easily visible, since the 
authors of (lOj use left derivatives, whereas I use left derivatives when differentiating w.r.t. 6*'s and 7r's and right ones 
w.r.t. #'s and 7f's. 

The conditions (|II.47|) where used to fix the parameters in the initial form of GPB ([11.48 jl , so these conditions are 
certainly satisfied by the bracket from the construction. However, it is necessary to verify that the remaining algebraic 
conditions are satisfied by GPB. All these conditions are 

[Fi,Ei]gp = — [Ei,Fi]gp : 

[Ai,A 2 ] gp = [A 2 ,Ai] GP , 

[Ei,FiF 2 ]gp — [Ei, F 1 ]gpF 2 + Fi[Ei, F 2 ]gp, 

[Ai,EiF 2 ]gp — [Ai,Ei]gpF 2 + Ei[Ai,F 2 ]gp, r 
[A^A^gp = [A 1 ,A 2 ]gpFi — A 2 [Ai,Fi]gp, (IL54) 
[Fx, [E 2 ,E 3 ] GP ] Gp + [E 3 , [F 1 ,E 2 ] GP ] GP + [E 2 , [E 3 ,F 1 ] GP ] Gp = 0, 
gp\gp ~ t^ 3 ' \E\,A2] G p\ Gp + [A 2) [A 3 ,E 1 ] GP ] GP — 0, 

[A U [A 2 ,A 3 ] 

gp\gp + I^ 3 ' [AiiA 2 \gp]gp + 1^2, [^3,Ai] GP ] GP — 0. 

where Ej's are even, -Ay's are odd and Fj's are arbitrary. These identities can be derived straightforwardly from 
the corresponding operator identities under the assumption that the bracket corresponds to the anti-commutator if 
both variables are odd and the commutator if at least one of them is even. These conditions can be written more 
succinctly if the parity index #F is introduced, which is for even F and 1 for odd F. Then the conditions (|II.54[) 
can be rewritten in an equivalent form given by eqs. (4.3), (4.4), (4.5) and (4.6) of Q3. However, when considering 
the examples, it is convenient to have them written down explicitly. 

One can verify by straightforward but lengthy calculations (preferably performed with the help of Mathematica or 
Maple) that GPB d efined by (III.45[) . pi.46[) and (III.53[) does i ndeed satisfy (|II.54p . To prove the Jacobi identities (the 
last three of ()II.54jl ). it is necessary to use (|II.31j) and fjll. 32[) . 



4- Constrained systems: the generalized Dirac bracket 

In order to handle second class constraints consistently, it is necessary to introduce classical bracket which weakly 
vanishes whenever one of its arguments is a second class constraint. Can the formula (|II.21j) for the Dirac bracket be 
adopted in the presence of odd variables? Certainly, at lest one modification is necessary, namely all Poisson brackets 
have to be replaced by generalized ones. I shall define the generalized Dirac bracket (GDB) as 

[F,G\ GD = [F 1 G] G p - [F, Xp >}gpC^'[x0',G}gp. (11.55) 

This is the bracket that will be replaced by commutators and anti-commutators. Hence, in order for the quantization 
procedure to be consistent, (|II.55[) ought to satisfy all the algebraic conditions pi.54[) (just replace GP by GDB in 
(III.54[) V I shall assume for simplicity that all the second class constraints xp have the same Grassman parity (this 
assumption will be weakened somewhat below) . Then is anti-symmetric in the even case and symmetric in the 
odd case. The matrix elements are even in both cases. Under this assumption it is straightforward to show that 
the identities pi.54[) are satisfied indeed by (III.55[) . Note however that the order of factors, as well as ordering of 
arguments of GP is important. For example, one could try to define GDB as 

[F,G]gdi = [F,G} GP + \xp,F]gpC^ [xp',G] G p (H.56) 

or 

[F,G] G d 2 = [F,G] GP - [xp-,G]gpC^'[F, Xp ] G p, (11.57) 
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both of these expressions being equivalent to pi. 551) so long as the constraints and all the variables are even. However, 
GDi will fail to be anti-symmetric in the case of %0i G odd and F even. Indeed, one gets 

[E, AjcDi + [A, E]gd 1 = 2[xp,E] GP C^ \x0>,A]gp, (H.58) 
where E is even and A and X/3 odd. GD2 will fail to satisfy the Leibniz rule for x/3 odd: 

[E, FA]gd 2 = {E,F} GD2 A + F[E,A} GD2 +2[xp>,F} GD2 C^'{E lX p}A (11.59) 

for E, F even and A, xp odd. The reader is encouraged to verify that other alternatives for pl.55|l arc not consistent 
with pT54|) . 

I assumed that all the second class constraints have the same Grassman parity. However, if these constraints can 
be separated into groups such that the constraints from different groups weakly commute with one another and all 
the constraints within a group have the same parity, then the results discussed above will also be true. This more 
general situation will occur in electrodynamics. 



III. CANONICAL QUANTIZATION OF THE DIRAC FIELD 



A. Classical Hamiltonian analysis 



The action of the theory is 

S D = J C D d 4 x, = \ $(xh a d a ip(x) - d a ^(x)j a ^(x)] - m#z)V(x). (III.l) 

Here (x) = (t, x) represents Minkowskian coordinates of flat space-time, tp can be thought of as a column of four 
complex valued functions if>i, I = 1, . . . , 4 on space-time, 7" are the Dirac matrices, ip := ip^-f is the Dirac conjugation 
(here \ denotes the Hermitian conjugation of a column matrix), a = 0, . . . , 3 is a space-time index. For fixed t, ipi{x) 
should be thought of as odd variables, so one needs to keep track of minus signs whenever the ordering of these 
fields is changed. However, consequent application of matrix notation in the calculations makes the ordering to be 
automatically correct in most cases and hence one can forget about the odd nature of "0's at the beginning of the 
analysis. 

The Dirac Lagrangian density can be written in many equivalent ways, owing to the possibility of adding divergence 
of a vector field. The choice Co differs by divergence of V a = ^j a ij} from the simplest Lagrangian density 

Csimp = ^ {rfda ~ m) Tp. (HI. 2) 

Although the two versions (as well as many others) are equivalent in flat space, they appear not to be equivalent when 
gravity, interpreted a Yang-Mills gauge theory of the Poinca re g roup, is minimally included [Tl) . A modified coupling 
procedure can be introduced that is free of this ambiguity pjj. This corrected coupling procedure is equivalent to 
the standard minimal one if Co, and not any other version of the Dirac Lagrangian, is used as a starting point for 
inclusion of gravity. Although this choice does not matter in the case of electrodynamics, I will chose Cd, since then 
the extension of our considerations to the gravitational case will be possibly straightforward. 
Calculation of the momenta in accordance with (|II.2[) gives 

(>€ i .^ y j w : ^_^ = J_ 1 ^_ (Hl.3) 



These relations does not involve time derivatives of fields and hence represent the primary constraints. In the matrix 
notation, the constraints can be written as 

Xi := 7T - ^7°, X 2 = W + ^7V (HI.4) 

Hence, xi is a row matrix, whereas X2 is a column. Strictly speaking, pil.4jl represents infinite number of primary 
constraints, which can be labeled by I and the space point x 

Xiix = ni(x) - 2^'(^)7?'iJ X2ix = ni(x) + l -{x)^ v tp v . (III.5) 
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The basic canonical variables ip, ip, tt, tt are all odd. The formulas pi.461) . (|II.45[) and (|II.53[) for GPB reduce to 

\ F n (( — SG + SG SF — 6F 6F 6G \rfl mm 

[ ' lGP J \6i>(x) 8ir(x) Sw(x) S^(x) T <ty(3) 5ir(x) Sw(x) f${x) ) ' [ ' 

where the upper sign applies whenever at least one of the variables F, G is even and the lower one corresponds to F and 
G odd. The matrix index I was omitted in favor of matrix multiplication between factors. If F is a scalar functional of 
the canonical fields, then the functional derivatives SF /Stp(x) and 5F/Sw(x) are column matrices, whereas SF /Sip(x) 
and 5F/Stt(x) are rows. To see this, consider an example of a functional 

f-Jumw**, an.?) 

where M is any matrix-valued function on space that does not depend on ip and ip. Under the infinitesimal change 
of ip, the variation of F is SF = f t/j(x)M(x)5ip(x)d 3 x, whereas the variation of F under the change of ip is SF = 
J 5ip(x)M(x)ip(x)d 3 x. It follows that 

SF — SF 

ip(x)M(x), = M(x)ip{x) (III.8) 



Sip{x) rv y v " SiP(x) 

and hence SF/Sip(x) is a row and SF /8ip{x) a column. Note that due to the matrix formalism the functional derivatives 
with respect to ip and if are automatically right derivatives and those with respect to ip and tt are left (this is why I 
chose such convention when discussing general systems with odd variables). 
The canonical Hamiltonian calculated in accordance with (|II.2[) is 

H = I H{x)d 3 x, 

- / i- \ ■ i \ - i (IIL9) 

% = nip + tp — Co = [ n ~ / ^ I ^ 2^°^ / _ wprfdjip + fntpip + —9j (ipj-'ip) > 

where j = 1, 2, 3 is a spatial index. Discarding the last boundary term, which would not contribute to the brackets of 
H with other variables, and using the constraints, we get 



H = J i>(g) (-i^dj + to) ip(x)d 3 x (III. 10) 

and 

H T = H+ [ (xi{x)u\x) + u 2 {x) X 2{x)) d 3 x, (III. 11) 



where u 1 and u 2 are correspondingly a column and a row of complex-valued functions, as yet undetermined. 

The consistency conditions for time evolution of constraints can be solved in two ways. One can utilize 

matrix formalism and use (|III.4[) . (IIII.6[) and to obtain 

[xi(x),H T ] GP = ~jit^: ~ ^^§\^° = - iu2 (xh° - idjipix)^ 3 -mip^O, 

SH T iSH , (IIL12) 

[X2{x),H T ] G p = - — +7°o^-t^V = H° ul (x) + il J djip(x) - mip « 0. 
5ip(x) 2 Stt(x) 

Clearly, these equations can be solved by the appropriate choise of u's. Therefore, no secondary constraints appear. 
The solution for m's is given by 

U 1 = -f^djip - im^ip, U 2 = -<9 3 ^7 J 7° + im^ a . (111.13) 

These expressions provide a particular solution of an inhomogeneous system of equations discussed in (|II.13|) and 
therefore uppercase letters are used (compare pi,14[) ). The general solution to the homogeneous part of pi,13[) is 
equal to in this case. 

Alternatively, instead of using matrices, one could rewrite the total Hamiltonian as 



[-iipi{x)lwdj'4 ) i'{x) + mip^x) + xusuj(x) + u 2 (x)x2isj d 3 x (III. 14) 
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and use the component form of constraints (|III.5|I . In the integrand of (IIII.14[) all the variables, together with u] and 
uf, are odd and their chronology is therefore important. When calculating the brackets of xus and X21S with the 
integrand of (|III.14|) , it is useful to use the commutation relations between the basic canonical variables 

[Wi(x), W v {x')] G p = 0, 



[ifji{x),ipv{x")] G p = [ipi(&),il>ii(rf)]GP = [iri{x),TTi<(x")} G p 
[^i(x),tti>(x)]gp = {Kit(x),ipi(x)} GP = 8 U t8{x - x 1 ), 
[■ip l (x),Wit(x / )] G p = [Wi>(x'),ijj l (x)]GP = -SiitS(x-x) 



(111.15) 



(note the minus sign in front of 5u> in the last expression) and the Leibniz rule for odd variables, [A, BC]cp 
[A, B]apC — B[A, C]gp- This results can be derived from (|III.6j) . Using them, one can calculate that 



C\lx,lVx' 
Cllx,1Vxt 
C\lx2l'x' 



— [xus, Xu'x']gp — 0, 

= [X2lx, X2l'x'}GP = 0, 

= [xiis,X21'x']gp = «7iV(a 



[X2Vx'i XUx]gP —'■ C2l'x',llx: 



[xiky, H T {x)} G p = j (iil)i(x)^ lk dj5(x - y) - (mip k (x) + ijf k uf(x)) 5{x ~ y) 
[X2ky,H T (x)} G p = J {ili v djil)it{x) - mip k {x) + i^u] (xjj S(x~y)d 3 x. 



d 3 x, 



(111.16) 



After the integrations are performed, the last two expressions reduce to (|III.12|) . When writing djS(x—y) I always mean 
the differentiation with respect to first variable, i.e. djS(x — y) = (djS)(x — y) = d/dx 3 S(x — y) — — d/dy 3 S(y — x) — 
—djS{y — x). So, unlike S, the function djS is odd in the sense that djS(-x) = —djS(x). The only nontrivial brackets 
that have to be evaluated in the calculations above are the brackets of the fields with the derivatives of their canonical 
conjugates, such as 



[^k(y),d 3 ipi(x)]GP = 



Sdjipijx) 
Hk{y) 



-S k idjS(y- x) = S k idjS(x - y). 



(III. 17) 



The easiest way to obtain this result is to write the derivative as djtpi(x) — J S(y — x)5ikdjipk{y)d 3 y and then variate, 
Sdjipi(x) = J 5(y — x)5ikd / dyi 5ipk(y)d 3 y — — J d / 'dy 3 S(y '— x)SikStpk(y)d 3 y, where in the last step the boundary term 
was omitted. 

Equations of motion 

Since the first class constraints are not present in the system, it follows that 



Ht — Hi 



H' = H+ / ( X i(g)U 1 {x) + U 2 {x) X2 (g))d 3 x 



ip (— ij 3 dj + m) tp — ( n — -^4>1^ j 7° {l^djip + imtp) — (djip~f 3 — irmjj) 7 f it + 



In the last form I dropped the argument x for simplicity. The equations of motion are 

ip = [tp, H']gp = — 7 (jy 3 djip + miff) , 
tp = [tJj, H']gp — — [djtpj 3 — imip) 7 , 
7t = [it, H']gp — —djnj ^ 3 + imn"/ , 
W = [7r, H']gp = — 7 J 7 djW — imj°W. 



(111.18) 



d 3 x. 



(111.19) 



The equalities of the form [f,H']cp = g, where / and g are functions defined on space, should be understood as 
equalities of functions, i.e. Vx, [f(x), H'] = g{x). The equations of motion (IIII.19[) need to be supplemented with the 
constraints. It is easy to see that all the equations then reduce to the Dirac equation 



{i^da -m)i) = 0. 



(111.20) 
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B. The generalized Dirac bracket and the equal time anti— commutators of field operators 

Having the matrix Cng^i'S' given by (|III.16[) . one can seek for its inverse by imposing the conditions 



4 4 

V=X k' = l J 

4 4 

El s-ilkxAk'x /~i j3 „t \ ^ / /~i2kx \2k' x /~i 

I l> <^lk'x',2ly a X — I ° <^2k'x 



S k i5{x-y), 



(111.21) 



>x>,llyd 3 x' = 0. 



A unique solution is given by 



and the generalized Dirac bracket is 



q11x,21'x" _ q2V ' x' ,llx 



-i-Y?i>6(i 



(111.22) 



[F, G] GD = [F, G] GP - E / / XiixhpC 113 ' 21 '*' lx2Vx>,G] GP + [F, X 2Vx>]gpC 21 ' s ' ' lls [ X im G\gp) d 3 xd 3 ~ 
= [F,G} G p+J2hu' / ([F,Xux}gp[X21>x,G} gp + [F, X 21>x}gp[xiix,G} gp ) d 3 x 

LI' 



with xiix an d X21S given by pil.5|) . The brackets of basic canonical variables can now be computed 

4>i(x),ipi>(x) 
-6u'6(x- x') 



(111.23) 



1pl(x),1pl'(x')]GD = [tTi(x),TTI'(x')]gD = [tpi(x),1p l ,(x')] G D = [lT i(x) , TT V (x' )] G D =0, 

1 

2< 

1 



ipi(x),iri>(%')]GD 
ip l (x),Wi>(x , )] G D 



i>i(x),iri>(x , )] GD : 

1pl(x),1p l ,(x')] G D 

■ni{x),Tfv(x')] GD ■ 



[iri'(x"),ipi(x)] G D 

[n>[x'),'tpi(x e )) GD 
bPi(x),n'(x)} GD 



-Su'S(x- x"), 



(111.24) 



= 0, 



[iPi>(x),ipi(x)} GD = ~i^u,S(x - x), 
[ni,(x , ),ir l (x)] GD = -j1u>S(x- x*). 



The equal time anti-commutation relations of field operators can now be readily established according to the procedure 

[F,G\ + = i\F\G} GD , (111.25) 

which is the generalized version of pi. 441) (note that GP is now replaced by GD). The second class constraints (|III.4[) 

are interpreted as strong operator equations in quantum theory and hence not all the basic field operators ij), ip, tt, 

7f are independent. For example, one could chose to use tp and ip as independent and interpret the remaining basic 
operators as derived from these in agreement with the constraints. Only the anti-commutation relations of ip and tj) 
are then necessary 



■0/(f),^'(^') 



0, 



iJi{x),i>v{x') 



0, 



QmM*) =iU{x~x'). 



(111.26) 



All the other anti-commutators can be derived from (|IIL26|) . The reader is encouraged to verify that the results thus 
obtained agree with those that could be derived directly from (jIII.24|l through (|III.25jl for each bracket separately. 
This follows from the fact the the Dirac bracket is consistent with the second class constraints. 



C. How the anti— commutation relations should not be derived 



In many treatments of quantum field theory (e.g. [8] [3]), the canonical analysis of the Dirac field begins with the 
introduction of the Hamiltonian formalism for field theory of systems that do not contain neither odd variables nor 
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constraints (usually on the example of the Klein-Gordon field). Then the simplest Lagrangian density (|III.2|) for the 
Dirac field is introduced, for which one of the constraints 

7T = i^7° (111.27) 

is derived. The conclusion is then drawn that ip is not an independent field, but rather a function of ir. 

Although the references mentioned do not do this, it may be tempting, especially for a student who has not yet been 
introduced with the formalism of constraints or odd variables, to proceed with the derivation of the anti-commutation 
relations in the following way: The usual PB for tp and ip can be calculated, under the assumption that tp is expressed 
through 7r according to (|III.27|) . One could then try to obtain the anti-commutation relation for field operators via 
(|III.25|1 . with GDB replaced by PB. The result thus obtained coincides with pil.26[) . Have we just circumvented the 
formalism of constraint systems and odd variables? Is then the concept of generalized Dirac bracket really necessary? 

Although this "derivation" yields correct anti-commutation relations (|III.26j) . it is important to stress that its 
correctness is conditioned by particular choice of Lagrangian density from the class of equivalence (i.e. £ S imp pil.2|) 
and not, say, Cd) pil.ljl ). as well as particular ordering of arguments of the brackets (if the anti-commutation relation 
for tjj and "0 was established in a way described above, then the result would obviously differ from the correct one by 
sign, due to the anti-symmetry of PB). 



D. Non— equal time commutation relations 



The non-equal time commutation relations for causal fields in flat Minkowski space are usually derived in a way 
that does not employ canonical formalism at all, but instead relies heavily on the Poincare symmetry (see Q or fl3|). 
On the other hand, the derivation of pil.26[) presented here does not invoke the Poincare symmetry at all. This 
is of importance, since the frequent usage of this symmetry in QFT is one of the most important obstacles for the 
straightforward extension of its apparatus to the case of curved space-time, which may not posses any symmetries 
at all. One of the aims of this article is to show how far one can go with the quantization by using the canonical 
formalism and not the space-time symmetries. 

To derive non-equal time anti-commutation relation for the Dirac field and its conjugate, consider the two space- 
time points (x) = (t, x) and (x 1 ) = (t + r, x'). Assuming the analyticity of tp in t, the GD for tp(x') and ip(x) can be 
represented by a series 

[^(x'),^ l {x)] GD =[^(t + T,x"),^ l (t,x)] GD = J2— ^(MO^zM) (HI.28) 

n=0 



GD 



where ip( n ' is the ro-th derivative of ip with respect to time. In order to shrink the series into something finite and 
simple, it is helpful to be organized. I shall now prove the 

Theorem III.l. The following relation holds 

tp^(t,x)=B n ip(t,x), (111.29) 

where B n denotes the n-th power of the matrix-differential operator 

B := -7°7 J 'Sj - im 7 °. (111.30) 

Proof. I shall use induction. For n = the result is trivially true and for n = 1 it follows from the first equation of 
PII.19[) . I will show that for each n £ N the inductive assumption 

^ nS >{x)=B n il){x) (111.31) 

implies that 

= B n+1 <ip(x), (HI.32) 

Note that 



(111.33) 
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In the second step the inductive assumption (MI. 311) was used. Since 

B n i()(x) = { 5(z-x)B n i;(z)d 3 z= { B"6{z-x)^(z)d 3 z, 



(111.34) 



where X denotes the differential operator obtained from X by the change of signs in front of the terms that are of 
even rank in derivatives (e.g. B = ^^dj — irwy ), it follows that 



SB n ip{x) 



B n S(z-x) 



Inserting this result to (|III.33|) leads to 
which proves the thesis (|III.32I) . 



(111.35) 



(111.36) 



□ 



Let us now calculate the bracket that occurs under the summation sign in (|III.28|) . Using (|H.55j) with x replaced 
by z and I, I' by k, k in order to avoid conflicting indices, one gets 



GD 



4 n) (f),^(f) 

\p\:\x")^ k (x) 



illk' 



GP 



GP 



'Hi 



GP 



, ^V) . a 5{B^{x')) v 



(111.37) 



Stpk(x) 



8ipk{x) 



Using pil.35|) and B n 5(z — x) = B n 8(x — z), which holds on account of the fact that even derivatives of the Dirac 
delta are even functions and odd derivatives are odd functions, one finally gets 



(111.38) 



In order to shrink (|III.28[) to the finite expression, I need to prove yet another 
Theorem III. 2. The following identity holds 



T 

i V -rB n 5(f)7° - -* (yfd a + m) A(r, x), 



(111.39) 



whe 



A(t,x) 



dT v U-^e^ - e iE v t e - t px\ 



dT n 



d 3 p 



(2ir) 3 2E p 



p + m 2 



(111.40) 



Proof. It is important to stress that the three-vector p — (p 1 , p 2 , p 3 ) is just an integration variable, which could be 
called in any other way, i.e. no interpretation of p as particle's momentum is necessary. To prove the theorem, I will 
first introduce the 



Lemma III. 3. For any k £ N the following identity holds 

(djdj - m 2 ) k S(x) = iA< 2fc+1 )(0,f). 
Proof. Straightforward calculation shows that 

(djdj - m 2 ) (e lp ~ + e" lp ") = {iE p f (e lp ~ + e~ lp ") , 

which easily generalizes to 

(d^ - m 2 ) fe (e l P s + e- lps ) = {iE p f k (e lp ~ + e~ 1 ^) . 



(111.41) 
(111.42) 
(111.43) 
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Decomposing the Dirac delta as 



5(x) 



2(2vr) 



(e ips + e~ lps ) d 3 p, 



acting on it by (djdj — m 2 ) k and using ([III. 431) yields 

{djdj - m 2 ) k 8(x) =~ij (iE p ) 2k+1 (e 4p ~ + e~^) dT p . 
On the other hand, differentiation of ()IIL40|) gives 

A (2fc+1) (i,:r) = - J (iE P ) 2k+1 ( e - iS *V^ + e^e -4 *®) dT p 
which, for t = 0, coincides with (|III.45|) . thus proving (|III.41|) . 



(111.44) 
(111.45) 

(111.46) 
□ 



Having the Lemma proved, it is easy to prove the Theorem by summing (|III.39[) over even and odd values of n 
separately. Since 



B 



-rf^didj — m 2 — djdj — m 2 



(111.47) 



(use [7 a ,7 b ]+ = 2rj ab , where r? = diag(l, -1, -1, -1)), it follows that 

B 2k 5{x) = (djdj - m 2 ) k S(x) = iA( 2k+1 \0,x), B 2k+1 S(x) = Bi/\ {2k+1) [Q,x) (111.48) 
and the LHS of (1ITL391) is 

' i — — A, — 

= £ 7 °A(t, f) + A(r, f) 7 ° = -i (i 7 a 9 a + to) A(r, x ). 



(111.49) 



□ 



Insertion of (|III.38|I into (|III.28j) and subsequent application of Theorem (|HI.2|) gives non-equal time anti- 
commutation relation 



or, equivalently, 



^i(x),i/j 1 ,(x') = {ij a d a + m)wA(x - x) 



Mx),® t (x')] = {(i 1 a d a + m) 1 } lll A(x-x'). 



(111.50) 



(111.51) 



It is important to stress that the formula (|III.28j) already shows that the form of non-equal time anti-commutation 
relations is fully determined by the canonical Hamiltonian formalism. Neither the Lorentz symmetry of the Dirac 
field nor the symmetries of space-time are needed to obtain this relations. All the work done afterwards was aimed to 
simplify the formula for the anti-commutator of ip and ip and here the symmetries were certainly helpful. However, 
even if that kind of simplification was not possible for technical reasons (e.g. in the generic curved space), the series 
formula similar to (|III.28p would still be obtainable and could be used in principle to finally compute measurable 
quantities from the theory. 



E. Quantization 

To quantize the theory, it is necessary to established the (anti)commutation relations between the basic field 
operators and all the other operators that represent physically important observables. Then it only remains to find 
a representation of these relations in a Hilbert space. The anti-commutation relations between the basic canonical 
operators have been established. The most important observable is the energy represented by the Hamiltonian. But 
which Hamiltonian? Since there are no first class constraints in the system, only H' and H are at our disposal. But they 
differ by second class constraints which strongly vanish in the quantum theory. Hence, the operators corresponding 
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to H and H' are equal. Recall from (III.22[) that the equations for time evolution 19[) can be rewritten in terms of 
GDB, instead of GP, e.g. 



iP^[iP,H] 



GDB 



-7 {pj 3 djip + mip) 



(111.52) 



(the prime at H was omitted, since the GDB of any variable with a linear combination of second class constraints 
weakly vanishes). Then the application of (|III.25|) and setting second class constraints to zero yields the operator 
version of the Dirac equation 



{i^da - m) V = 0. 



(111.53) 



The equation is sufficiently simple that a general solution can be given. This is very pragmatical, since then, if 
ip assumes a form of general solution to pil.53[) . the commutation relations of ip and ip with the Hamiltonian are 
automatically satisfied and it only remains to impose the equal time anti-commutation relations of fields pil.26[) (and 
also the commutation relations of fields with other observables such as momentum and electric charge, which I shall 
not discuss here). 

Acting by ij b db + m on the LHS of (|III.53|) yields the Klein-Gordon equation, (□ + m 2 )ip(x) = 0, from which it 
follows that ip is necessarily of the form 

i/j(x) -- 



-ip-x 



A{p) + e^Bip)) dT p , 



(111.54) 



where p ■ x — p°t — p ■ x. Here and below in this article it is assumed that p° — E p (we are on the mass shell) . For 
fixed p, think of A(p) and B{p) as four-component columns whose entires are operators on a Hilbert space which has 
not yet been defined. The necessary and sufficient conditions for (|III.54[) to provide a solution to (|III.53|) are 



a) (0-m)A(p)=O, 

b) (tf + m)B(p)=0, 



(111.55) 



where jp = p a ^ a - For fixed p, the space of solutions for these equations are given by two-dimensional subspaces 
of C whose bases will be denoted by u(p,a) for a) and v(p,a) for 6), where a numbers the two basis vectors and 
conventionally takes values 1/2 and —1/2. The general solution of (|IIL53[) can thus be written as 



ip{x) 



'■ x u„{p)a a {p) + e^ x v a (p)a c J(p)) dT p , 



(111.56) 



where a a (p) and a c J (p) are now completely arbitrary operator-valued functions of p. Although these functions are 

arbitrary from the viewpoint of the Dirac equation, the anti-commutation relations for tp, ip, ip and ip that follow 
from the canonical formalism uniquely determine the anti-commutation relations between a a {p) 1 cl c ^{p) and their 
Hermitian conjugates. To see this, note that (|III.56[) can be inverted to yield 



x (e p $(x) + ipj{x)^j d 3 x 
r ip - x (Epipix) - i#(s)) d 3 x 
r ip - x (e p ${x) - ift(x?) d 3 x 
ip ' x (E p ft(x)+ift{x)) d 3 x, 



(111.57) 



Uvfflacrip) - 

Va{p)a c J{p) 

where a c (p) := [a c ^{p)^ and * means complex conjugation. Since the basis vectors u a {p) for different values of a are 
linearly independent (the same concerns v, u* and v*), these equations specify uniquely the coefficients a a (p), a^(p) 
and their conjugates in terms of field operators whose anti-commutation relations are known from the canonical 
analysis. One can seek for the choice of bases u and v which makes the anti-commutation relations between a's 
particularly simple. For example, the choice proposed in [13j leads to 



a><r(p),a>i>(p'] 



a c a (p), a c J,(p') = (2*f2E p 5 aa ,5{p-p'), 



[a CT (p),ao-'(p')]+ = K(P)X'(P")]+ = a a (p),a c J,(p') = a%{p),a\,{p') 



(111.58) 
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These relations are so simple that they can be readily represented in the appropriate Hilbert space of many-particle 
states by the Fock quantization, which is described in all textbooks on quantum held theory. The commutation rela- 
tions of a's with the Hamiltonian support the interpretation of a£(p) and a c J(p) as the operators that create particles 
of energy E p and their conjugates as annihilation operators. If the commutation relations with all the components of 
the energy-momentum tensor were investigated, the interpretation of the argument p as particle's momentum could 
be supported. The index a could be interpreted as the projection of particle's spin on the quantization axes in the 
particle's rest frame if the commutation relations of a's with the components of spin density tensor were inspected. 
Finally, establishing the commutation relations with the electric charge operator allow for the interpretation of the 
superscript c as denoting anti-particles. 

The reader might feel that the Poincare symmetry that had been avoided so conscientiously throughout was finally 
employed. After all, both the energy-momentum tensor and the angular momentum tensor (including spin density 
part) are composed of Noether conserved currents that are related to the Poincare symmetry of the action. I have 
just written that the commutation relations of creation and annihilation operators with momenta and spin operators 
are needed if the physical interpretation of p and a is to be established. 

Although it is certainly true that the conserved quantities mentioned above can be obtained by Noether procedure 
from the Poincare global symmetry, it does not mean that they can not be obtained differently In general relativity, 
the energy momentum tensor of matter t PlV is conventionally obtained by varying \/ \det{g)\S m with respect to the 
inverse space-time metric g^ v , were Sm is the matter part of the action. This procedure is well defined in any space- 
time, no matter what symmetries it might posses, and yields the results that are compatible with Noether-Bellinfante 
method in flat space. There is a problem with spin density, though. If, however, gravity is interpreted as a Yang-Mills 
theory of the Poincare group, than both the energy-momentum tensor and spin density tensor can be obtained by 
varying the matter action with respect to the tetrad and the connection (see [1JJ for the details). Hence, the Poincare 
symmetry of space-time is not really necessary. All the steps of the canonical quantization of the Dirac field performed 
in this article could be accomplished in principle in curved space as well. What is really problematic in curved space 
are all the technical complications. For example, it is not possible to find explicit solutions to the Dirac equation in 
most curved space-times. 

IV. CONCLUSIONS 

The general canonical formalism described in Section U was successfully applied to the theory of the Dirac field 
in Section IIII1 It was argued that only after the Grassman odd variables are introduced the consistent canonical 
quantization of fcrmionic fields may be possible. Also, the Lagrangian for the Dirac field was shown to lead to the 
presence of constraints in the theory, which altogether made the application of a concept of generalized Dirac bracket 
necessary. It was argued that the Poincare symmetry or the causal structure of space-time does not have to be 
involved in the program of quantization if the canonical method is consequently followed. 
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